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Image compression 

The present invention relates to a method and apparatus for compressing a digitised 
image. 

5 

Currently, the most widely used standard for the compression of continuous-tone still 
images, both greyscale and colour, is that known under the acronym JPEG, after the Joint 
Photographic Experts Group. JPEG specifies inter alia a discrete cosine transform (DCT) 
based method for the lossy compression of still images. 

10 

A DCT based encoder 1 employing JPEG is shown schematically in Figure 1 (see 
Wallace, G.K. "The JPEG Still Picture Compression Standard" Communications of The 
ACM, April 1991). For a digitised greyscale image 2 which is composed of a matrix of 
pixel intensity values (e.g. 512X480), the image is first subdivided into 8x8 pixel blocks 3. 

15 The pixel blocks 3 are fed in sequence to the encoder 1 which has at its input a forward 
DCT (FDCT) unit 4. The DCT is related to the discrete Fourier transform (DFT), such that 
the FDCT unit 4 effectively transforms each 8x8 block 3 into 64 orthogonal basis signals 
or DCT coefficients, each of which corresponds to one of 64 "spatial frequencies ". In 
effect, the DCT represents the frequency spectrum of the input block 3. The DCT 

20 coefficient with zero frequency in both dimensions is the "DC coefficient and the 

remaining 63 coefficients are the "AC n coefficients. It is a general property of images that 
pixel intensity values vary slowly from pixel to pixel across the image. Thus, for a typical 
8x8 sample block 3 from a typical source image, most of the DCT coefficients have zero 
or near-zero amplitude. 

25 

Following the FDCT unit 4, each DCT is provided to a quantiser unit 5 which quantises 
the DCT coefficients using a 64-element quantisation table stored in a table specification 
memory 6. The elements of the quantisation table specify the quantisation step size for 
respective DCT coefficients. In practice, the quantisation table used is either a 'base' 

30 table stored in the memory 6 (CCITT Recommendation T.81 , "Digital Compression and 
Coding of Continuous-Tone Still Images - Requirements and Guidelines", Annex K) or a 
table generated by uniformly scaling the elements of the base table. Typically, 100 
different tables are defined corresponding to a range of quality levels, Q=1 to 100, where 
the base table corresponds to a quality level Q=50. It is noted that within any one 

35 quantisation table, the quantisation step sizes may vary from element to element. 
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After quantisation, the quantised coefficients of each DCT are provided to a run length 
encoder 7 as a data stream, the coefficients being ordered according to increasing 
frequency. The run length encoder 7 takes advantage of consecutive zeros in this data 
5 stream to compress the data. As has already been noted, for a typical image block the 
DCT coefficients tend to be small so that, after quantisation, the number of zeros in the 
DCT data stream is likely to be great. Run-length encoding may therefore achieve a 
significant level of compression. Finally, the run length encoded data stream is provided 
to an entropy encoder 8, which further compresses the data stream using, for example, 
1 0 Huffman coding, to generate a compressed Image' 9. 

In many applications, the number of bits which can be used to represent a compressed 
image is predefined. One such application is the proposed transmission of still images 
via the short message service (SMS) provided for by the cellular telephone standard 

15 GSM. The maximum length of a single concatenated short message is 34170 (255x134) 
octets (or bytes). In order to allow a still image to be transmitted by a single concatenated 
short message, the compressed image must therefore occupy less than 34170 octets. 
However, because the spectral characteristics of different images can differ greatly, it is 
extremely hard to predict the size of the compressed image which will be produced by 

20 using a particular quantisation table (or Q value). The general practice in order to satisfy 
a predefined bit budget is to select a quantisation table on the basis of past experience, 
and to apply this to obtain a compressed image. If the compressed image does not 
satisfy the bit budget, a second quantisation table is selected via a quantisation table 
selection unit 10 and a new compressed image generated. This process is carried out on 

25 a "trial and error basis until a compressed image is obtained which satisfies the bit 
budget. 

It will be appreciated that the trial and error nature of the compression method outlined 
above is inefficient in so far as the quantisation and encoding steps often have to be 
30 repeated many times before a compressed image is obtained which satisfies the bit 
budget. 

According to a first aspect of the present invention there is provided a method of 
compressing a digitised image composed of a matrix of image samples to provide a 
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compressed image which satisfies a predefined bit budget, the method comprising the 
steps of: 

1) dividing the digitised image into blocks and deriving for each block an energy- 
packing transform comprising a set of transform coefficients; 

2) selecting a quantisation table from a set of quantisation tables and using the 
selected table to quantise the coefficients of each transform; 

3) deriving a zero-value index indicative of the number of zero-value quantised 
transform coefficients; 

4) determining a predicted zero-value index using said predefined bit budget; 

5) selecting a quantisation table from said set of tables using the derived index 
and said predicted index and using that selected table to quantise the coefficients of each 
transform; and 

6) compressing the coefficients quantised in step 5) using run-length encoding. 

Embodiments of the present invention enable a compressed image to be generated, 
which satisfies a predefined bit budget, using only a single encoding step 6). 

The compressed image may be divided into blocks of any suitable size. The blocks may 
be contiguous or overlapping. Typically however, the blocks are contiguous and each is 
composed of 8x8 pixels. 

Preferably, said energy-packing transform is a discrete Fourier transform (DCT). 
Alternative energy-packing transforms may be used however, such as a Karhunen-Loeve 
transform. 

For a greyscale image, said image samples are greyscale intensity values. For a colour 
image, a matrix of image samples may be provided for each of a set of colours, e.g. red, 
blue, and green, and the matrices processed separately according to the method of the 
invention. The compressed image comprises a combination of the compressed 
coefficient sets. Alternatively, and in order to increase the compression ratio further, 
luminance (Y) and chrominance (U,V) matrices may be generated from the red, blue and 
green colour matrices. Again, the luminance and chrominance matrices are processed 
separately according to the above method. 
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Preferably, said zero-value index is the average number of zero-value quantised 
transform coefficients in the transforms. Alternatively however, the median or another 
representative value may be used. 

5 Preferably, step 3) comprises deriving zero-value indices for each of a plurality of different 
quantisation tables to provide a zero-value index vs quantisation table relationship. It is 
however not necessary to requantise the transform coefficients for each additional 
quantisation table. Rather, the additional zero-value indices can be derived from the first 
obtained set of quantised coefficients. 

10 

Preferably, the method comprises obtaining a reference zero-value index vs bit budget 
relationship by: 

7) dividing a digitised test image into blocks and deriving for each block an energy 
packing transform comprising a set of transform coefficients; 
15 8) selecting a quantisation table from a set of quantisation tables and using the 

selected table to quantise the coefficients of each transform; 

9) deriving a zero-value index indicative of the number of zero-value quantised 
transform coefficients; 

10) compressing the coefficients quantised in step 5) using run-length encoding; 
20 11) determining the bit size of the compressed image; 

12) repeating steps 7) to 1 1) for a plurality of different quantisation tables to obtain 
a zero-value vs bit size relationship for the test image; and 

13) repeating steps 7) to 12) for a plurality of different test images and combining 
the resulting relationships to obtain the reference zero-value index vs bit budget 

25 relationship, 

wherein this relationship is used in step 4) to determine the predicted zero-value 
index using the predefined bit budget. 

Step 5) then comprises using the predicted zero-value index and the zero-value index vs 
30 quantisation table relationship for the image to be compressed, to select the quantisation 
table. This selection may comprise interpolating between the zero-values of the derived 
set of zero-values which neighbour the predicted zero-value. 

In order to ensure that the final compressed coefficient set satisfies the predefined bit 
35 budget, the selection of the final quantisation table is preferably a conservative one. For 
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example, the predefined bit budget may actually be less than the actual number of bits 
which can be transmitted, stored or otherwise processed. 

Preferably, steps 6) and 10) comprise entropy encoding the data, e.g. using Huffman 
encoding, following run-length encoding. 

According to a second aspect of the present invention, there is provided apparatus for 
compressing a digitised image composed of a matrix of image samples to provide a 
compressed image which satisfies a predefined bit budget, the apparatus comprising: 

first signal processing means for dividing the digitised image into blocks and for 
deriving for each block an energy packing transform comprising a set of transform 
coefficients; 

quantisation means for quantising the coefficients of each transform using a first 
quantisation table selected from a set of quantisation tables; 

second signal processing means for deriving an index representative of the 
number of zero-value quantised transform coefficients, for determining a predicted zero- 
value index using said predefined bit budget, for selecting a quantisation table from said 
set of tables using the derived index and said predicted index and using that selected 
table to quantise the coefficients of each transform; and 

encoding means for compressing the coefficients quantised by the second signal 
processing means using run-length encoding. 

The apparatus of the present invention may be incorporated into a mobile communication 
device, for example a cellular phone. 

For a better understanding of the present invention and in order to show how the same 
may be carried into effect reference will now be made, by way of example, to the 
accompanying drawings, in which: 

Figure 1 shows a block diagram of a DCT-based lossy encoder according to the 

prior art; 

Figure 2 shows a block diagram of a DCT-based lossy encoder according to an 
embodiment of the present invention; 

Figure 3 shows byte-size vs zero-value index relationships for respective test 
images; 
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6 

Figure 4 shows the zero-value indices obtained with a number of different 
quantisation tables (Q) for an image to be compressed; and 

Figure 5 is a flow diagram illustrating a method of compressing an image. 

5 The general architecture of a DCT-based encoder embodying the present invention is 
shown in Figure 2. This is a modification of the encoder already described with reference 
to Figure 1, and like parts are identified with the same reference numbers. The encoder 
is suitable for use in the compression of still frames in accordance with the JPEG 
standard, although it may also be used to satisfy other compression standards and 
10 methods. A conventional decoder may be used to decode images compressed with this 
encoder. 

The encoder of Figure 2 comprises a modified quantisation table selection unit 1 1 . This is 
arranged to store a look-up table or other representation of the relationship between the 

15 size of a compressed image (termed the 'bit budget') and an index termed the 'zero-value 1 
index. As has already been explained above, the DCT obtained for each block 3 of an 
image 2 contains a set of DCT coefficients, a large number of which may be zero after 
quantisation. For a given image, the zero-value index of that image is determined by 
counting the number of zero-value quantised coefficients in each DCT, and determining 

20 the average number of zero-value coefficients per DCT. 

The stored relationship is constructed using a number of library or test images, chosen to 
be representative of a number of different styles, for example images containing little 
detail, e.g. sky, and images containing a large amount of detail, e.g. landscapes. Each 
25 test image is processed by sub-dividing the image into blocks and determining the DCT 
for each block. This set of DCTs is then quantised with each of the quantisation tables 
(Q=1 to 100) in turn, where the tables are generated from the base table (Q=50) using the 
following relationships: 



30 f or Q < 50 ; k 



5000 
Q 



TB Q [ij] 



TB S0 [ij].k + 50 
100 



for £>50; k 



200-2(2; 



TB Q [ij] 



TB 50 [i,j]k + 50 
100 
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where TB^ij] is the quantisation step for the base table element in the / th row and the 
j th column and TB Q [ij] is the quantisation step for the new table element in the i th 
row and the j th column. TB Q [ij] are in practice also rounded to the nearest integer. 

5 The resulting quantised DCTs are encoded using run-length encoding and entropy 

encoding (i.e. Huffman encoding). The size of the compressed image (bit budget) is then 
determined. In addition, the zero-value index is calculated for each set of quantised 
DCTs. Figure 3 shows the zero-value index vs bit budget for a number of different test 
images. This set of relationships is then averaged to generate an archetypal relationship. 

10 It is found that in general the zero-value index vs bit budget relationship of an image 
deviates little from the archetypal relationship. 

The archetypal relationship is stored as already described in the modified quantisation 
table selection unit 1 1 , typically as a look up table. A new digitised image to be 
15 compressed is passed, block by block, to the FDCT 4 to generate a DCT for each block. 
A quantisation table corresponding to Q=97 is generated by the modified quantisation 
table selection unit 1 1 , from the table specification memory 6, for use by the quantiser 
unit 5 to quantise each DCT in turn. The number of zero-value coefficients in each DCT 
is then determined and the zero-value index calculated. 

20 

Using the set of quantised DCTs, it is also possible to determine the zero-value index for 
lower Q values. For example, it can be seen that all coefficients TB 91 [ij] having the 
value 1 will have the value 0 after quantisation with Q=91. It is therefore only necessary 
to count the number of O's and Vs in each DCT with Q=97, and find the average per 
25 block, to determine the zero-value index for Q=91 . There is shown in Figure 4 a plot of Q 
value vs zero-value index for one particular image. 

Assuming that a particular bit budget has been predefined (BB) for the compression 
stage, the relationship illustrated in Figure 3 and held by the quantisation table selection 
30 unit 1 1 , can be used to identify that zero-value index (a 'predicted' zero-value index) 

which will meet the bit budget. The identified zero-value index can then in turn be used, 
in conjunction with the specific relationship of Figure 4, to identify the Q value which will 
achieve this bit budget for the image under consideration. The quantisation table 
corresponding to that Q value is then generated and applied by the quantiser unit 5 to 



WO 99/04553 



PCT/FI98/00512 



8 

quantise the set of DCTs for the image. The quantised DCTs are then applied to the rui 
length encoder 7 and to the entropy encoder 8 as already described to generate the 
compressed image. 

5 Figure 5 is a flow diagram illustrating the method described above. 

It will be appreciated by the skilled person that modifications may be made to the above 
described embodiment without departing from the scope of the present invention. 
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1 . A method of compressing a digitised image composed of a matrix of image 
samples to provide a compressed image which satisfies a predefined bit budget, the 
method comprising the steps of: 

1) dividing the digitised image into blocks and deriving for each block an energy 
packing transform comprising a set of transform coefficients; 

2) selecting a quantisation table from a set of quantisation tables and using the 
selected table to quantise the coefficients of each transform; 

3) deriving a zero-value index indicative of the number of zero-value quantised 

transform coefficients; 

4) determining a predicted zero-value index using said predefined bit budget; 

5) selecting a quantisation table from said set of tables using the derived index 
and said predicted index and using that selected table to quantise the coefficients of each 
transform; and 

6) compressing the coefficients quantised in step 5) using run-length encoding. 

2. A method according to claim 1 , wherein said zero-value index is the average 
number of zero-value quantised transform coefficients per transform. 

3. A method according to claim 1 or 2, wherein step 3) comprises deriving zero-value 
indices for each of a plurality of different quantisation tables to provide a zero-value index 
vs quantisation table relationship. 

4. A method according to any one of the preceding claims and comprising obtaining 
a reference zero-value index vs bit budget relationship by: 

7) dividing a digitised test image into blocks and deriving for each block an energy 
packing transform comprising a set of transform coefficients; 

8) selecting a quantisation table from a set of quantisation tables and using the 
selected table to quantise the coefficients of each transform; 

9) deriving a zero-value index indicative of the number of zero-value quantised 
transform coefficients; 

10) compressing the coefficients quantised in step 5) using run-length encoding; 

11) determining the bit size of the compressed image; 
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12) repeating steps 7) to 1 1) for a plurality of different quantisation tables to obtain 
a zero-value vs bit size relationship for the test image; and 

13) repeating steps 7) to 12) for a plurality of different test images and combining 
the resulting relationships to obtain the reference zero-value index vs bit budget 

5 relationship, 

wherein this relationship is used in step 4) to determine the predicted zero-value 
index using the predefined bit budget. 

5. A method according to claim 4 when appended to claim 3, wherein step 5) 

10 comprises using the predicted zero-value index and the zero-value index vs quantisation 
table relationship for the image to be compressed, to select the quantisation table. 

6. A method according to any one of the preceding claims, wherein said energy 
packing transform is a discrete cosine transform (DCT). 

15 

7. A method according to any one of the preceding claims, wherein step 6) 
comprises entropy encoding the data after run-length encoding. 

8. Apparatus for compressing a digitised image composed of a matrix of image 
20 samples to provide a compressed image which satisfies a predefined bit budget, the 

apparatus comprising: 

first signal processing means for dividing the digitised image into blocks and for 
deriving for each block an energy packing transform comprising a set of transform 
coefficient; 

25 a quantisation table specification memory (6) storing a set of quantisation tables; 

quantisation means (5) for quantising the coefficients of each transform using a 
first quantisation table selected from said set of quantisation tables; 

second signal processing means (11) for deriving an index representative of the 
number of zero-value quantised transform coefficients, for determining a predicted zero- 
30 value index using said predefined bit budget, for selecting a quantisation table from said 
set of tables using the derived index and said predicted index and using that selected 
table to quantise the coefficients of each transform; and 

encoding means (7,8) for compressing the coefficients quantised by the second 
signal processing means using run-length encoding. 

35 
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encoding and Huffman encoding. 
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